A Learner-Verifier Framework for Neural Network Controllers and Certificates of Stochastic Systems
نویسندگان
چکیده
Abstract Reinforcement learning has received much attention for controllers of deterministic systems. We consider a learner-verifier framework stochastic control systems and survey recent methods that formally guarantee conjunction reachability safety properties. Given property lower bound on the probability being satisfied, our jointly learns policy formal certificate to ensure satisfaction with desired threshold. Both are continuous functions from states reals, which learned as parameterized neural networks. While in case, certificates invariant barrier safety, or Lyapunov ranking liveness, case supermartingales. For verification, we use interval arithmetic abstract interpretation expected values network functions.
منابع مشابه
scour modeling piles of kambuzia industrial city bridge using hec-ras and artificial neural network
today, scouring is one of the important topics in the river and coastal engineering so that the most destruction in the bridges is occurred due to this phenomenon. whereas the bridges are assumed as the most important connecting structures in the communications roads in the country and their importance is doubled while floodwater, thus exact design and maintenance thereof is very crucial. f...
network of phonological rules in lori dialect of andimeshk: a study within the framework of post-generative approach.
پژوهش حاضر ارائه ی توصیفی است از نظام آوایی گویش لری شهر اندیمشک، واقع در شمال غربی استان خوزستان. چهارچوب نظری این پژوهش، انگاره ی پسازایشی جزءمستقل می باشد. این پایان نامه شامل موارد زیر است: -توصیف آواهای این گویش به صورت آواشناسی سنتی و در قالب مختصه های زایشی ممیز، همراه با آوانوشته ی تفصیلی؛ -توصیف نظام آوایی گویش لری و قواعد واجی آن در چهارچوب انگاره ی پسازایشی جزءمستقل و معرفی برهم کن...
Adaptive Predictive Controllers Using a Growing and Pruning RBF Neural Network
An adaptive version of growing and pruning RBF neural network has been used to predict the system output and implement Linear Model-Based Predictive Controller (LMPC) and Non-linear Model-based Predictive Controller (NMPC) strategies. A radial-basis neural network with growing and pruning capabilities is introduced to carry out on-line model identification.An Unscented Kal...
متن کاملa framework for identifying and prioritizing factors affecting customers’ online shopping behavior in iran
the purpose of this study is identifying effective factors which make customers shop online in iran and investigating the importance of discovered factors in online customers’ decision. in the identifying phase, to discover the factors affecting online shopping behavior of customers in iran, the derived reference model summarizing antecedents of online shopping proposed by change et al. was us...
15 صفحه اولNeural network-based quality controllers for manufacturing systems
This paper demonstrates that neural networks can be used e ectively for quality control of non-linear static time-variant processes where the process physics and mechanistic models are not well understood. The emphasis of the paper is on models for both identi® cation and real-time process parameter design of manufacturing systems. Both multi-layer feed-forward perceptron networks and radial b...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2023
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-031-30823-9_1